System and method for speech-enabled call routing

ABSTRACT

A method of processing a call is disclosed. The method can transform speech input from a caller of a call into text and convert the text into an object and an action. The method determines a call destination based on the object and the action. The method can route the call to a destination when a caller is not in compliance with at least one business rule. The method can further route the call to the call destination when the caller is in compliance.

FIELD OF THE DISCLOSURE

The present disclosure relates generally to speech recognition systems and more particularly to speech recognition-enabled automatic call routing systems and methods.

BACKGROUND

Speech recognition systems are specialized computers that are configured to process and recognize human speech and may also take action or carry out further processes. Developments in speech recognition technologies support “natural language” type interactions between automated systems and users. A natural language interaction allows a person to speak naturally. Speech recognition systems can react responsively to a spoken request. An application of natural language processing is speech recognition with automatic call routing (ACR). A goal of an ACR application is to determine why a person is calling a service center and to route the caller to an appropriate agent or destination for servicing a request. Speech recognition technology generally allows an ACR application to recognize natural language statements so that the caller does not have to rely on a menu system. Natural language systems allow the caller to state the purpose of their call “in their own words.”

Natural language ACR systems attempt to interpret the intent of the caller based on the spoken language and then selects a routing destination. When a speech recognition system misinterprets the caller's intent, significant problems can result. A caller who is misrouted is generally unhappy. Misrouted callers realize that there has been a mistake with the automated phone service and often terminate the call by hanging up. If a caller does not hang up they will typically be routed to an operator who will attempt to manually route the call. Routing a caller to an undesired location and to human operators leads to considerable inefficiencies for a business. Most call routing systems handle a large volume of calls and, even if a small percentage of calls are abandoned, the costs associated with abandoned calls are significant. Accordingly, there is a need for an improved automated method and system of routing calls.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a simplified configuration of a telecommunication system;

FIG. 2 is a general diagram that illustrates a method of routing calls;

FIG. 3 is a flow diagram that illustrates a method of processing and routing calls;

FIG. 4 is a table that depicts speech input and mapped synonym terms; and

FIG. 5 is a table illustrating action-object pairs and call destinations relating to the action-object pairs.

DETAILED DESCRIPTION OF THE DRAWINGS

A method of processing a call is disclosed. The method can transform speech input from a caller of a call into text and convert the text into an object and an action. The method determines a call destination based on the object and the action. The method can route the call to a destination when a caller is not in compliance with at least one business rule. The method can further route the call to the call destination when the caller is in compliance.

In a particular embodiment, the system includes a call routing system that has a processor configured to convert speech input from a caller into an object and an action. The system has an action-object routing module configured to determine a first destination for the call based on the object and the action. The call routing system can route the call to a second destination when the caller is not in compliance with a business rule.

In another embodiment the communication system includes an acoustic model configured to accept speech input and to produce text and a semantic model coupled to the acoustic model for producing an action and an object responsive to the text. The system further includes a routing module table responsive to the semantic model to provide a routing destination based on the action and the object.

Referring now to FIG. 1, an illustrated communications system 100 that includes a call routing support system is shown. The communications system 100 includes a speech-enabled call routing system (SECRS) 118, such as an interactive voice response system having a speech recognition module. The communications system 100 includes a plurality of potential call destinations. Illustrative call destinations shown include service departments, such as billing department 120, balance information 122, technical support 124, employee directory 126, and new customer service departments 128. The system 100 includes a communication network 116 that receives calls from a variety of callers, such as the illustrated callers 110, 112, and 114. In a particular embodiment, the communication network 116 may be a public telephone network or may be provided by a voice over Internet protocol (VoIP) type network. The SECRS 118 may include components, such as a processor 142, a synonym table 144, and an action-object routing module 140. The SECRS 118 is coupled to and may route calls to any of the destinations, as shown. In addition, the SECRS 118 may route calls to an agent, such as the illustrated live operator 130. An illustrative embodiment of the SECRS 118 may be a call center having a plurality of agent terminals attached. Thus, while only a single operator 130 is shown, it should be understood that a plurality of different agent terminals or types of terminals may be coupled to the SECRS 118, such that a variety of agents may service incoming calls. In addition, the SECRS 118 may be an automated call routing system. In a particular embodiment, the action-object routing module 140 includes an action-object lookup table for matching action-object pairs to desired call routing destinations.

Referring to FIG. 2, an illustrative block diagram of the SECRS 118 is depicted. In this particular embodiment, the processor 142 in the SECR 118 includes an acoustic model 210, a semantic model 220, and an action-object routing table 230. The acoustic model 210 receives speech input 202 and provides text as its output 204, and the semantic model 220 receives text 204 from the acoustic model 210 and produces an action-object pair. The actions and objects in the action-object table are ordered or ranked according to a confidence level. A confidence level indicates how likely a particular action or object reflects a correct and useable caller instruction. A call can be routed based on the overall highest confidence level action and object in an action-object list in the routing table 230. An exemplary routing table is depicted in FIG. 5. Objects are listed on the left side of the chart and actions are listed across the top of the chart. If the action of “acquire” has the highest confidence level for an action and “DSL” has the highest confidence level for an object, then the call would be routed to extension “001” according to the intersection in the table of the action-object pair “acquire-DSL.” As a further example, when the action-object pair of “cancel-DSL” is selected, then the call would be routed to extension “002” according to the routing table. In the event that a confidence level for an action-object pair is below a predetermined level, the call may be routed to a human operator or agent terminal.

The routing table 230 can receive a selected action-object pair 206 from the semantic model 220 and produces a desired call routing destination 208. Based on the call routing destination 208, a call received at the SECRS 118 may be routed to a destination, such as billing department 120 or technical support service destination 124, as shown in FIG. 1. In a particular embodiment, the action-object routing table 230 may be a look-up table or a spreadsheet, such as Microsoft Excel™.

Referring to FIG. 3, an illustrative embodiment of a method of processing a call using an automated call routing system, such as the system of FIG. 1, is illustrated. The method starts at 300 and proceeds to step 302 where a speech input signal, referred to as an utterance, is received or detected. Information about the caller is also received, such as information that can identify the caller or the caller's account number. This information can be automatically provided or it can be requested using a voice prompt and received as part of the utterance or as touchtone input. Using phonemes, the received speech input is converted into a plurality of word strings or text in accordance with an acoustic model, as shown at steps 304 and 306. In a particular embodiment, confidence levels are assigned to word strings based on established rules and the content and coherency of the word string. Next, at step 308, the word strings are parsed into actions and objects. Objects generally represent nouns and adjective-noun combinations while actions generally represent verbs and adverb-verb combinations. The actions and objects are assigned confidence values based on how likely they are to reflect the intent of the caller.

Many possible actions and objects may be detected or created from the word strings. To aid in the selection of a particular action and object, a table such as a synonym table can be utilized, as shown in FIG. 4. The synonym table can store natural language phrases and map the phrases to a predetermined list of actions and objects. Textual conversion of the natural language spoken by the caller in the acoustic model can be compared to natural language phrases in the table based on the phonetic-to-text conversion. Using the synonym table, the system and method maps portions of the natural phrases to at least one action or at least one object.

Additionally, the synonym table can provide rules for distinguishing recognized actions and recognized objects from specific actions and specific objects. Generally, there may be numerous actions that map to the same destination and numerous objects that map to the same destination. For example, in FIG. 4, the recognized object “Named Services” can include specific objects, such as “Caller ID,” “Call Notes,” “Call Forward” and other features or services that a single call service department may process. Thus, when a call request creates a recognized action or recognized object from a class of possible actions or objects, the call can be routed based on the recognized and selected action or the recognized and selected object. The specific action or specific object can be stored and transmitted with the call such that a destination system can utilize the action or object to assist the caller and meet the caller's objectives.

Recognized actions and objects can be assigned a confidence level based on textual similarities between predetermined or known actions and objects and results of the phonetic conversion. Additionally, rules that are based on consistency between a recognized action or an object and known phonemes can also be utilized to determine the confidence level. The method selects a most probable action and object from a list of recognized objects and actions utilizing the confidence levels, as is illustrated in steps 310 and 312. The selected action and selected object are paired and a current routing destination is determined at step 314. The method can track the treatment of the call utilizing three variables known as prior routing destination, current routing destination, and future routing destination. The prior routing destination variable refers to a previous location having processed the call, the current routing destination refers to a present location that is processing the call, and the future routing destination variable refers to where the call may be routed for processing. The variables can be set and reset throughout the process, as the call is routed, possibly among call centers. Thus, after an initial action object selection, the variables may be as follows: future routing destination equals null; prior routing destination equals null; current routing destination equals action-object table selection results.

Using the caller information received, the method determines if the caller is violating or has violated any business rules at decision step 316. For example, if a caller is late on their payment, or has a voice message box that has been full for over two months, the method may determine that the caller has violated a business rule. When a business rule has been violated, the call can be routed to a destination or a call service department determined by the business rules, as shown in step 320. When the call is transferred to a call service department that can overcome the rule violation, the variable can be reset as follows: (future routing destination equals table results; current routing destination equals business rules table; prior routing destination equals null). The call is routed to the current routing destination at step 322 and the caller is encouraged to take action to comply with the business rule that has been violated. After an attempt has been made to comply with the violation, the variables are reset (prior routing destination equals current routing destination; current routing destination equals future routing destination; future routing destination equals null), at step 324. The method proceeds back to decision step 316 to determine if the caller remains in violation of at least one business rule. If the caller is not in violation of a business rule, then the call is routed to the caller's desired destination, at step 318, the call routing destination variables are reset, and the method ends at step 326.

The above-disclosed subject matter is to be considered illustrative, and not restrictive, and the appended claims are intended to cover all such modifications, enhancements, and other embodiments that fall within the true spirit and scope of the present invention. Thus, to the maximum extent allowed by law, the scope of the present invention is to be determined by the broadest permissible interpretation of the following claims and their equivalents, and shall not be restricted or limited by the foregoing detailed description. 

1. A method of processing a call comprising: transforming speech input from a caller of a call into text; converting the text into an object and an action; determining a first call destination based on the objects and the action; routing the call to a second call destination when the caller is not in compliance with at least one business rule.
 2. The method of claim 1, further comprising converting the text into a specific action and transmitting the specific action.
 3. The method of claim 1, further comprising converting the text into a specific object and transmitting the specific object.
 4. The method of claim 2, further comprising transmitting the specific object with the routed call.
 5. The method of claim 1, further comprising assigning a confidence value to the action and the object.
 6. The method of claim 1, further comprising assigning a confidence level to the action based on a probability that the action represents an intent of the caller, and assigning a confidence value to the object based on a probability that the object represents an intent of the caller.
 7. The method of claim 1, wherein the action is one of a verb and an adverb-verb combination.
 8. The method of claim 1, further comprising placing a caller in compliance with at least one business rule subsequent to routing the call to the second call destination and after the caller fulfills an obligation.
 9. The method of claim 8, further comprising routing the call based on the action and the object to the first call destination after placing the caller in compliance.
 10. The method of claim 1, further comprising routing a call based on a current destination.
 11. The method of claim 1, further comprising routing a call based on at least one of a current destination, a future destination, and a prior destination.
 12. The method of claim 1, further comprising assigning a current routing destination variable to a destination.
 13. The method of claim 1, further comprising setting a current routing destination to the call routing destination after the call is routed.
 14. The method of claim 1, further comprising setting a prior routing destination to the current routing destination.
 15. The method of claim 1, wherein the current routing destination is set to the call routing destination as determined by a selected action-object pair.
 16. A system comprising: a call routing system having a processor, the call routing system configured to convert speech input from a caller into an object and an action; an action-object routing module configured to determine a first destination for the call based on the object and the action; wherein the call routing system routes the call to a second destination when the caller is not in compliance with at least one business rule.
 17. The system of claim 16, wherein the first destination is based on at least one of a selected action and a selected object.
 18. The system of claim 16, wherein the call routing system routes the call based on at least one of a current routing destination, a future routing destination and a prior routing destination.
 19. The system of claim 16, wherein the call routing system sets a future destination to the first routing destination.
 20. The system of claim 16, wherein the call routing system sets a current routing destination to the second destination.
 21. The system of claim 16, wherein the call routing system sets a prior routing destination to the current destination after the call is routed to the second destination.
 22. The system of claim 16, wherein the processor uses phonemes to convert the speech input into a textual format.
 23. The system of claim 16, wherein the processor assigns a confidence level to the textual format.
 24. The system of claim 16, further comprising a synonym table accessible to the processor.
 25. A system comprising: an acoustic model configured to accept speech input and to produce text at its output; a semantic model responsive to the acoustic model to produce an action and an object in responsive to the text; a call routing module responsive to the semantic model to provide a routing destination based on the action and the object; and wherein the routing destination is determined based on at least one of a current routing destination, a future routing destination and a prior routing destination.
 26. The system of claim 25, wherein the routing destination is determined based on a selected action and a selected object.
 27. The system of claim 25, wherein the call routing module sets a future routing destination based on a selected action and a selected object.
 28. The system of claim 25, wherein the routing module sets a current routing destination to a routing destination after it is determined that the caller violates a business rule.
 29. The system of claim 25, wherein the call routing module sets a prior routing destination to the current routing destination after the call is routed to the routing destination. 